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Preface 



As part of a study of the feasibility and effect of teaching 
Navajo children to read in their own language first, we have col- 
lected a corpus of 11,128 sentences of spoken Navajo. The text 
consists of interviews conducted by twenty adult Navajos with over 
two hundred children, most of them six-year-olds. Full details of 
the collection and analysis of this corpus is given in Navajo Read- 
ing Study Progress Report No. 9« 

This present study reports on the results of the analysis of 
gprnpheme and unit frequencies in the text: it should be of value 

to those concerned with the developing interest in Navajo literacy. 



Bernard Spolsky 
Principal Investigator 




3 



I. INTRODUCTION 



Venezky, 1 in The Structure of English Orthography, makes a useful 
distinction between a grapheme and a functional unit * A grapheme 
is a single letter or character; a unit is one or more letters or 
characters that function as a spelling unit. Thus the Navajo word 
CH f IL would have five graphemes, G - H - f - I - L, but only three 
units, OH * - I - L. 

Grapheme counts should be of interest to those concerned with 
the printing of Navajo; unit counts should be of interest to those 
concerned with the spelling, and the teaching of reading, in Mfevajo. 

The Werner-Begishe alphabet used in this study is essentially a 
form of the Young and Morgan alphabet modified for computer use* 

The alphabets differ only as shown below. 





< i > — 


< LH > 




< ti > — 


< TL > 




«... cl ! > - — 


< TL' > 




< ' > 


< 7 > 




< d > 


< 8 > 


(High tor 


is v.ritten as a < 7 > after 


each high tone vowel graphe 


nasality i: 


written as an < 8 > before 


a vowel grapheme sequence i 



its entirety.) 

There are, then, 23 graphemes required to write Navajo in this a.l liabet . 



^Richard Venezky . The Structure of English Orthography . Mouton: Ihe Hague. 

1970 . _ " 
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A 

B 

C 

D 



E 

G 

H 

I 

J 



K 

L 

M 

N 

0 



S 

T 

W 

X 

Y 



2 

7 

8 



But, because there are a number of English loan words in the speech 
of Navajo six-year-olds (Gee Navajo Reading Study Progress Report No. 16), 
the graphemes < F >, < P >, < Q >, < R >, and < V > do occur in the text. 
Many of these words, especially personal and place names are, for the 
speaker, Navajo words. Thus, we can Say that to write Navajo as it is 
spoken by six-year-olds requires 29 graphemes — the 26 ordinarily found 
in English plus < ' >, < 7 >, and < 8 >. 

Grapheme frequencies for consonants are computed by combining the 
;quencies of all graphemes and defining the frequency of each grapheme 
as a percentage of that total. 

The functional units of Navajo are a bit more difficult to arrive 
at. To facilitate explanation here, we discuss consonantal, vowel, 
and diphthongal units in turn. 

Consonantal units . 

A relatively large number of Navajo units are digraphs or trigraphs. 
Larger units sharing a grapheme (or contiguous graphenes) with a smaller 
unit or grapheme will be counted, by the computer, as an occurrence of 
the latter. Thus, the < T > of < TS > will be counted as an occurrence 
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of < T >. Similarly, with the < TS* >, the < T >, the < 3 >, the < 1 > 
and the < TS > will he counted as occurrences of each cf those graphemes 
or units. Thus, to obtain the true frequency of such consonantal units, 
one must discount the other occurrences by computation. Simplifying 
things a bit, we would obtain the true frequency of the < T > units by 
finding the remainder of all < T > graphemes minus the number of < TS > 
and < TS 1 > units. 

< X > is something of a problem. One use of < X > is as an alter- 
nant to < H > after < 3 > — thus avoiding confusion between < S > < H > 
and < SII >. But the < X > is also used to mark heavier aspiration which 
is used to indicate intensification or a derogatory sense. < X > is 
only marginally phonemic; two or three minimal pairs can be found* We 
have counted both uses of < X > as an instance of the first unit and 
< X >.* The transcriber seems to have been disturbed by the use of stem- 
initial < H > after a prefix-final < H >. Here she writes < HX >. This 
has been counted as an occurrence of both < H > and < X >. 

The transcriber did not write < *N > after consonantal units; 
thereby avoiding apparent three-consonant sequences which otherwise 
do not occur in normalized orthography e Only in such an environment 
is < f N > mechanically distinguishable from a sequence of < 1 > < M >. 
The same is the case with < *M > and < f Y >. 



*An inspection of all words containing < X > in the corpus showed that Navajo 

< X? occurred only in < TSX > and < CIIX > sequences# There were no < SX > 
sequences ; all apparent <C SX > sequences turned out to be sequences of 

< S > < X > as noted earlier, where the < X > replaces an < H >. Thus, 
the use of < X > < X > in this corpus is not as ambiguous as it could 
be. All occurrences of < X > were very small* 
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consonantal 


units 


of Navajo, 


then, are 


as follow; 


« 


DZ 


K 


S 


TS 


B 


G 


K' 


SH 


TS» 


GH 


GH 


KW 


T 


W 


CH 1 


H 


L 


T' 


X 


D 


HW 


LH 


TL 


Y 




J 


M 


TL* 


Z 



Vowels 

Both vowels and digraphs pose some problems* There are so many 
potential digraph sequences that the nesting of shorter sequences with- 
in longer ones becomes rather complex. Here it is easier to search 
for instances of given vowel or digraph sequences bounded by non- 
vowel graphemes (i.e*, consonants, space, or punctuation marks)* 

Thus, using * here to mark non- vowel, it is easier to search for a 
sequence of < *£I* > than to work out which other vowels and diph- 
thongs occur in sequences such as < El >, < EI7 >, < EI7I7 >» etc - 
The Young and Morgan orthography lends itself to an analysis of 
Navajo vowels in which any of four vowels may be short or long (in 
duration), oral or nasal (in nasality), and high or low f in tone). 

With the proviso that rising or falling tone occurs only with long 
vowels , one obtains 12 basic vowel types . 



V 


8v 


V7 


8V7 


w 


8w 


V7V7 


SV7V7 
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Given four basic vowels, < A >, < E >, < I >, and < 0 >, one obtains 
a matrix of ^+8 possible vowel sequences* Probably all of these do 
occur although some, like < 8 E? > are not at all common. 

Diphthongs 

Diphthongs are much more poorly defined* Young and Morgan list 
< A I >, < AO >, < El >, and < 01 > type diphthongs* But < 10 > type 
diphthongs do occur* 

Orthographically there are what might be called two-vowel and 
three-vowel diphthongs « But Hoijer (19^5) gives examples of a 
number of diphthongs which, due to normalization, are of types that 
would not occur in the Young and Morgan orthography: overtly marked 
nasal diphthongs, what might be called four-vowel diphthongs , and 
diphthongs with what might be called circumflex tone » 

Bather than arbitrarily decide which sequences should occur, we 
have simply tried to list all possible combinations which might 
occur — with the expectation that many of these vdll not occur. 

Unit frequencies are then computed by combining the frequencies 
of all consonants, vowels, and diphthongs together and computing the 
frequency of each unit as a percentage of that total. 

Children and *iult s 

Our initial analysis of grapheme frequencies showed a marked 
difference in the frequencies of < H > for children and adults — 

7.1 and 8.7 respectively. This may well reflect that in the 
interviews from which we derive our corpus, the adults are asking 
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most of the questions. Both -I7SH, the yes-no question marker and 
-SH as a reduced form of SH8A * , a generalized information question 
marker, contain < H>. As about a third of the corpus, by length, 
is adult speech, marked differences in adult unit frequencies might 
affect overall frequency figures. We have, therefore, computed 
grapheme and unit frequencies for children and adults separately. 
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II. GRAPHEMES 

Graphemes are single characters or letters. In the 
Werner-Begishe computer-compatible alphabet, there are 21 
Navajo letters and two diacritics — <7> marking high tone 
and <8]> marking nasality. An additional six letters occur 
in personal names and other English loanwords. 

Absolute Grapheme Frequencies 

There are more than 350,000 graphemes in the corpus. 

The frequency of the 29 graphemes is given in Table I. It 
should be realized that while those graphemes which occur 
only in English have been listed last, those graphemes which 
occur in Navajo as well as in English have been subsumed 
under the Navajo grapheme. The distinction is an arbitrary 
one; all 29 graphemes must be considered as part of current 
Navaj o orthography r . 

Relative Grapheme Frequencies 

Because there are a different number of graphemes for 
adults and children, it is a bit difficult to compare rela- 
tive frequencies. In Table II, the absolute grapheme fre- 
quencies have been converted to percentages . Due to rounding , 
and the fact that several graphemes have frequencies of less 
than 0.1%, the totals are a bit more than 100%. 
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TABLE I 



Absolute Grapheme Frequencies 





Adult 


Child 


Total 


f 


8,178 


12,223 


20,401 


A 


16,800 


26,683 


43,483 


B 


1,336 


2,704 


4,040 


C 


563 


1,488 


2,051 


D 


5,176 


10,077 


15,253 


E 


7,916 


14,575 


22,491 


G 


2,105 


3 ,138 


5,243 


H 


11,437 


15,511 


26,948 


I 


17,614 


28,041 


45,655 


J 


061 


1,548 


2,409 


K 


1,741 


3,200 


4,941 


L 


4,747 


8 ,587 


13,334 


M 


271 


740 


1,011 


N 


6 ,210 


9,931 


16,141 


0 


7,641 


15,361 


23,002 


S 


3,993 


5,923 


9,916 


T 


3,737 


5,678 


9,415 


W 


623 


1,110 


1,733 


X 


34 


101 


135 


Y 


1,901 


3,520 


5,421 


Z 


921 


1,729 


2,650 


7 


25,032 


41,246 


66,278 


8 


2,706 


4,651 


7,357 


F(E) 


63 


200 


263 


P(E) 


70 


319 


389 


Q (E) 


2 


3 


5 


R(E) 


328 


969 


1,297 


U(E) 


73 


249 


322 


V ( E) 


62 


105 


167 




132,141 


219,610 


351,751 
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TABLE II 

Relative Grapheme Frequencies 
Adult % Child % Total % 



1 


6.1 


5.6 


5.8 


A 


12.7 


12.2 


12.4 


B 


1.0 


1.2 


1.1 


C 


.4 


. 7 


.6 


D 


3.9 


4.6 


4.3 


E 


6.0 


6.6 


6.4 


G 


1.6 


1.4 


1.5 


H 


8.7 


7 . 1 


7.7 


I 


13.3 


12 8 


13.0 


J 


. 7 


. 7 


. 7 


K 


1.3 


1.5 


1.4 


L 


3.6 


3.9 


3.8 


M 


.2 


. 3 


. 3 


N 


4.7 


4.5 


4.6 


O 


5.8 


7.0 


6.5 


S 


3.0 


2.7 


2.8 


T 


2.8 


2.6 


2.7 


W 


.5 


. 5 


.5 


X 


>.l 


>.l 


>.l 


Y 


1.4 


1.6 


1.5 


Z 


. 7 


. 8 


. 7 


7 


18.9 


18.8 


18.8 


8 


2.0 


2.1 


2.1 


F (E) 


>.l 


.1 


.1 


P(E) 


>.l 


.1 


.1 


Q(E) 


>•1 


>.l 


>.l 


R(E) 


.2 


. 4 


. 3 


U (E) 


. 1 


.1 


. 1 


V(E) 


>.l 


w >. 1 


>. 1 
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Rank-Ordering of Graphemes 

To enable a closer comparison of the relative grapheme 
frequencies of adults and children, they are shown in order 
of descending relative frequency * ^able III. Here it may 
be seen that the rank-ordering of a 'ulcs and children is 
very close. 
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TABLE III 

Hank Ordering of Graphemes 





Adult 






Child 




Rank 


Unit 


% 


Rank 


Unit 


% 


1. 


7 


18.9 


1. 


7 


1 n 8 


2 . 


I 


13.3 


2 . 


I 


.2.2 


3. 


A 


12.7 


3. 


A 


1.2 


4. 


H 


8.7 


4. 


H 


-1 

_L 


5. 


f 


6.1 


5 . 


O 


1 0 


6 . 


E 


6.0 


6 . 


E 


6 .6 


7. 


O 


5.8 


7. 


t 


5.6 


8. 


N 


4 . 7 


8. 


D 


4 .6 


9 . 


D 


3.9 


9 . 


N 


4 . 5 


10 . 


L 


3.6 


10 . 


L 


3.9 


11. 


s 


3.0 


11. 


S 


2 . 7 


12 . 


T 


2.8 


12 . 


T 


2.6 


13. 


8 


2.0 


13. 


8 


2.1 


14. 


G 


1.6 


14. 


Y 


1.6 


15 . 


Y 


1.4 


15 . 


K 


1.5 


16. 


K 


1.3 


16. 


G 


1.4 


17. 


B 


1.0 


17. 


B 


1.2 


18. 


S 


.7 


18. 


Z 


. 8 


19. 


J 


. 7 


19 . 


J 


. 7 


20 . 


w 


. 5 


20. 


C 


. 7 


N) 

H 

• 


c 


. 4 


21. 


W 


. 5 


N) 

N) 

• 


R (E) 


.2 


22 . 


R (E) 


. 4 


23. 


M 


. 2 


23 . 


M 


.3 


24 . 


U 


. 1 


24. 


U 


.1 


25. 


P (E) 


> .1 


25. 


P (E) 


.1 


26 . 


F (E) 


7.1 


26 . 


F (E) 


.1 


27. 


V(E) 


>.l 


27 . 


V (E) 


7.1 


28. 


X 


>. 1 


28. 


X 


>,1 


29 . 


Q 


>.l 


29 . 


Q 


>.l 
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III. UNITS 

Units are single or multigrapheme sequences that function 
as units. Thus <0^ , and <(0 would be graphemes. <(ch) 



While there are more than 350,000 graphemes in the corpus, 
there are roughly 210,000 units — indicating (in the Werner- 
Begishe computer-compatible orthography) a relatively high 
frequency of multigrapheme units. 

Vowel -Consonan t Freque ncies 

Of the units in the text, roughly 45% are vowels and 
roughly 55% are consonants—seaming to indicate a prepon- 
derance of CV (rather than CVC) syllabels . 

The relative vowel consonant frequencies are shown in 
Table IV. 



and 




<^H^> and but not <(C/> are units in Navajo 



TABLE IV 



Relative V - <2 Frequencies 



Number 



% age 



Vowels 



95,620 



45.0% 



Consonants 



117,038 



55 . 0 % 



All Units 



212,658 
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III. A. 



VOWEL UNITS 



Errors: A total of 57 vowel units are considered to 

be errors. Some of these seem to have come from transcrip 
tion errors. Others seem to have come from word or mor- 
pheme separation errors. The result in both instances are 
sequences that we consider to be impossible Navajo vowel 
units . 

57 known errors among roughly 95,000 vowels is not 
enough to affect the overall percentages. 

The errors are shown in Table V. There has not been 
time to correct these errors. 
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TABLE V 

Vowel Unit Errors 



Transcription Errors : 

* AAI7I7 1 

* A7AA7 1 

* A7A7A 1 

* A7A7A7 1 

* All 1 

* E7I0I 1 

* E7I7U 1 

* IA7A 1 

* I7A7 3 

* I7E7I7 1 

* OOO 1 

* OOU 1 

* 07077 1 

* 08E7E7 1 

* 8I7IE7I 1 

* 8717 1 

Sub-total 18 

Separation Errors 

* 7 28 

* 7E7 1 

* 71 1 

* 717 4 

* 707 1 

* 8 4 

Sub-total 39 

TOTAL 57 
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English Vowe l Frequencies 

Of the roughly 95 , 000 vowel units, 630 units are 
definitely English. This is less than 1%. Such a 
figure is undoubtedly too low; it represents only those 
vowel units that do not occur in Navajo proper. 

Those English vowel units which do not occur in Navajo 
proper are shown in Table VI. The most frequently occur- 
ring word is shown in parenthesis. The fact that a number 
of these words are given names would seem to indicate that 
these English vowel units cannot be ignored in the teaching 
of Navajo orthography. 

In Tables VII and VIII, the use of English vowel units 
by adults and children are compared. While both percentages 
are quite low, the fact that the figure for the children 
is twice that of the adults would tend to confirm the 
notion that children use more English loan words than do 
adults . ^ 

‘'"Agnes Holm, Wayne Holm and Bernard Spolsky. "English Loan 
Words in the Speech of Six-year-old Navajo Children. Navajc 
Reading Study Progress Report No. 16. The University of 
New Mexico. August 1971. 
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TABLE VI 

English Vowel Units 

Vowels 

U 237 

Diphthongs 



AE 


(Mae) 


9 


AU 


(Laura) 


10 


EA 


(bear) 


112 


EO 


(George) 


14 


EU 


(Eugene ) 


4 


IA 


(Indian) 


30 


IE 


(Eddie) 


73 


10 


(lion) 


22 


OA 


??? 


42 


OE 


(potatoes) 


8 


OU 


(mouse) 


47 


OUI 


(Louis ) 


2 


UA 


(Juanita) 


2 


UE 


(blue) 


11 


UI 


(building) 


7 




Total 


630 



(l) T he counting program noted 42 occurrences of an OA unit 

but the exemplification program did not y '■.Id any examples. 
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TABLE VII 



Adult 


-Child Use of 


English Vowel 


Units 




Adult 


Child 


Total 


U 


46 


191 


237 


AE 


3 


6 


9 


AU 


4 


6 


10 


EA 


20 


92 


112 


EO 


3 


11 


14 


EU 


2 


2 


4 


IA 


11 


19 


30 


IE 


30 


43 


73 


10 


1 


21 


22 


OA 


8 


34 


42 (1 > 


OE 


1 


7 


8 


OU 


15 


33 


CN 

■‘w' 

00 

<3* 


OUI 


1 


1 


2 


UA 


-0- 


2 


2 


UE 


1 


8 


9 (2) 


UI 


3 


4 


7 




14 9 


480 


629 (2) 



See note with Table V. 

( 2 ) 

There are a few slight differences between the original 
count and the later search program. 





18 



TABLE VIII 

Adult - Child Percentage of use of_ English vowel Units 





Total 




Units 


Adult 


36,121 


Child 


59,499 


Combined 


95,620 



English 




Vowel 




Units 


Percentage 


149 


0.41% 


480 


0.81% 


629 


0.66% 
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Absolute Vowel and Diphthong Frequencies . 

The absolute frequencies of vowels and diphthongs are 
shown in Table IX. 

The forms preceded by an asterisk are definite errors . 
(see page 14) . Others may turn out, upon inspection, to 
be errors. The forms followed by an (E) are definite English 
forms. Those English forms identical to Navajo forms will 
have been subsumed under the Navajo forms. 
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TABLE IX 



Absolute Vowe l a nd Diphthong Frequencies 



Adult 



A 


4,490 


AA 


1,636 


AAI7 


28 


*AAI7I7 


1 


AA7 


4 


AE (E) 


3 


AI 


209 


All 


100 


AII7 


2 


AO 


9 


A00 


24 


AU (E) 


4 


A7 


3,346 


A7A 


506 


*A7AA7 


1 


A7A7 


1,267 


*A7A7A 


1 


*_a_7a7A7 


0 


A7I 


4 


A7II 


2 


Kill 


116 


A7I7I7 (?) 


1 


A70 


5 


A700 


0 


*A77 


1 


E 


1,433 


EA (E) 


20 


EE 


702 


EEI7 


1 


El 


87 


Eli 


133 


EII7 


30 


EI7 


1 


EO (E) 


3 


EU (E) 


2 


E7 


1,362 


E7E 


359 


*E7EI 


0 


E7E7 


397 


E7I 


168 


E7II 


33 


*E7lOI 


0 


E7I7 


1,017 


E7I7I7 (?) 


1 


*E7I7U 


0 


I 


4,176 


IA (E) 


11 


*IA7A 


1 



Child 


Total 


7,505 


11,995 


2,246 


3,882 


43 


71 


0 


1 


4 


8 


6 


9 


239 


448 


28 


128 


10 


12 


3 


12 


425 


449 


6 


10 


4,464 


7,810 


595 


1,101 


0 


1 


2,811 


4,078 


0 


1 


1 


1 


8 


12 


8 


10 


28 


144 


0 


1 


10 


15 


1 


1 


0 


1 


2,304 


3,737 


92 


112 


1,112 


1,814 


1 


2 


280 


367 


387 


520 


45 


75 


1 


2 


11 


14 


2 


4 


1,704 


3,066 


485 


844 


1 


1 


986 


1,383 


703 


871 


66 


99 


1 


1 


1,752 


2,769 


1 


2 


1 


1 


7,497 


11,673 


19 


30 


0 


1 




2,1 
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TABLE IX - continued 



IE ( ) 


Adult 

30 


Child 

43 


Total 

73 


II 


773 


2,212 


2,985 


117 


35 


17 


52 


10 ( ) 


1 


21 


22 


17 


3,345 


5,638 


8,983 


*I7A7 


1 


2 


3 


*I7E7I7 


0 


1 


1 


171 


247 


432 


679 


1717 


1,998 


1,311 


3,309 


0 


2,011 


3,031 


5,042 


OA ( ?) 


8 


34 


42 


OE ( ) 


1 


7 


8 


01 


7 


10 


17 


Oil 


2 


5 


7 


017 


1 


0 


1 


OI7I7 


1 


0 


1 


00 


959 


1,365 


2,324 


0017 


9 


9 


18 


*000 


0 


1 


1 


*00U 


0 


1 


1 


007 


39 


12 


51 


0U( ) 


15 


32 


47 


OUI( ) 


1 


1 


2 


07 


1,176 


1,613 


2,789 


071 


2 


9 


11 


0711 


4 


12 


16 


0717 


33 


87 


120 


070 


135 


97 


232 


0707 


812 


2,732 


3,544 


*07077 


1 


0 


1 


*08E7E7 


0 


1 


1 


U( ) 


46 


191 


237 


UA( ) 


0 


2 


2 


UE( ) 


2 


9 


11 


UI( ) 


3 


4 


7 


*7 


17 


11 


28 


* 7E7 


0 


1 


1 


*71 


1 


0 


1 


*717 


1 


3 


4 


*707 


1 


0 


1 


*8 


1 


3 


4 


8A 


605 


115 


720 


8AA 


79 


334 


413 


8A7 


93 


247 


340 


8A7A 


137 


190 


327 


8A7A7 


211 


491 


702 


SEE 


2 


6 


8 


8E7 


0 


2 


2 


8E7E 


96 


396 


492 


8E7E7 


238 


589 


827 


8E7I7 


0 


1 


1 


81 


127 


246 


373 
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TABLE IX - continued 





Adult 


Child 


Total 


811 


52 


144 


196 


817 


204 


393 


597 


8171 


82 


159 


241 


*8I7IE7I 


0 


1 


1 


8I7I7 


517 


660 


1,177 


80 


45 


73 


118 


800 


12 


70 


82 


807 


75 


87 


162 


8070 


15 


37 


52 


80707 


114 


406 


520 


*8717 


1 


0 


1 
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Relative Vowel and Diphthong Frequen cies 

The relative frequencies of vowels and diphthongs are 
shown in Table X. 

The comments made with respect to Table ix also apply to 
Table X. 
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TABLE X 

Relative Vowel and Diphthong Frequencies 





Adult 


Child 


Total 




Percentage 


Percentage 


Percentage 


*AAI7I7 


XI 


>. 1 


>.: 


*A7AA7 


J>. 1 


2.1 


>. 1 


* A7A7A 


A. 1 


> • 1 


>.- 


*A7A7A7 


>1 


>. 1 


2.1 


A7I7I7 


>. 1 


>.l 


2. 1 


A700 


A 1 


>.l 


2. l 


All 


>. 1 


>.l 


2. 1 


E7EI 


>.l 


XI 


2. 1 


*E7I0I 


>•1 


>•1 


2.1 


*E7I7U 


XI 


xl 


>. 1 


IA7A 


>. 1 


>.l 


2.1 


I7E7I7 


XI 


Xl 


2-1 


017 


XI 


X l 


> ' 


OI7I7 


XI 


>. i 


2. _ 


000 


>1 


>. l 


2 . ‘ 


OOU 


XI 


>. l 


>.- 


07077 


XI 


>.i 


2.. 


08E7E7 


XI 


>.i 


>. 


7E7 


2.1 


>. l 


>.l 


71 


XI 


>. l 


>.: 


707 


X 1 


>. l 


X: 


8E7I7 


XI 


2-1 


>.i 


8I7IE7I 


XI 


X 1 


>. i 


8717 


7. 1 


X 1 


X l 


OUI 


X 1 


>.l 


2.1 


El 7 


XI 


2.1 


X l 


8E7 


XI 


2.1 


Xl 


E7I7I7 


XI 


2.1 


Xl 


UA 


XI 


2. 1 


Xl 


EEI 7 


XI 


2.1 


7.i 


I7A7 


XI 


>.l 


7.1 


EU 


XI 


2.1 


>. l 


8 


XI 


>. 1 


>. 1 


111 


XI 


Xl 


Xl 


UI 


X. 1 


Xl 


7. 1 


Oil 


>.l 


Xl 


7. 1 


AA7 


>. 1 


Xl 


7.1 


8EE 


XI 


Xl 


2.1 


OE 


XI 


2. 1 


7.1 


AE 


XI 


Xl 


Xl 


AU 


21 


2.1 


2.i 


A7II 


2.1 


Xl 


Xl 


UE 


XI 


Xl 


2.1 


071 


7.1 


Xl 


7.1 


AO 


XI 


Xi 


Xi 


A7I 


XI 


Xl 


7.1 


All 7 


XI 


7.1 


Xl 


EO 


7.1 


2.1 


7.1 



O 
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TABLE X - continued 



A70 

C7II 

OI 

0017 

10 

7 

IA 

OA 

OU 

CO 7 

8070 

117 

AAI7 

IE 

EII7 

800 

E7II 

EA 

80 

0717 

All 

A7I7 

807 

811 

070 

U 

8171 
8A7A 
8A7 
El 
81 
8AA 
AI 
AOO 
8E7E 
80707 
Eli 
817 
17 1 
8A7A7 
8A 

8E7E7 

E7E 

E7I 

A7A 

8I7I7 

E7E7 

EE 



Adult 

Percentage 

> 1 

> - 

>!i 

>.i 
>i 
>i 
>. i 
>.i 
7. 1 
.1 
.1 
.1 
. 1 
.1 
.1 
.1 
.1 
.1 
.1 
.1 
. 3 
.3 
.2 
.1 
.4 
.1 
.2 
.4 
.3 
.2 
.4 
.2 
.6 
.1 
.3 
. 3 
.4 
. 6 
. 7 
.6 
1.7 
.7 
1.0 
.5 
1.4 
1.4 
1.1 
1.9 



Child 

Percentage 

>.l 
>• 1 
>.l 

7.1 

>.l 

7.1 
>. 1 

.1 
.1 
. 1 
.1 
.1 
.1 
.1 
.1 
. 1 
.1 
.2 
. 1 
.2 
.1 

.1 

o 

.2 
.2 
.3 
.3 
. 3 
.4 
.5 
.4 
.6 
.4 
.7 
.7 
. 7 
.7 
.7 
.7 
.8 
.2 
1.0 
. 8 

1.2 
1.0 
1.1 
1.7 
1.9 



Crtal 

Percentage 



1 



, 1 
.1 
. i 



.1 
.2 
.2 
.2 
.2 
.3 
. 3 
.3 
.4 
.4 
.4 
. 4 
. 5 
.5 
.5 
.5 
.5 
.6 
.7 
.7 
.8 
. 8 
.,9 
.9 
1.2 
1.2 
1.5 
1.9 



O 

ERIC 
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TABLE X - continued 





Adult 


Child 


Total 




Percentage 


Percentage 


Percentage 


00 


2.7 


2 . 3 


2.4 


E7-T7 


2. & 


2.9 


2.9 


C~ 


3.3 


2.7 


2.9 


*r -r 
X X 


2.1 


3.7 


3 . 1 




3.8 


2.9 


3.2 


1777 


5.5 


2.2 


3.4 


0707 


2.3 


4.6 


3.7 


E 


4.0 


3.9 


3.9 


AA 


4.5 


3.8 


4.0 


A7A7 


3.5 


4.7 


4 . 3 


0 


5 . 6 


5.1 


5.3 


A7 


9.3 


7.5 


8.2 


17 


9.3 


9.5 


9 . 4 


I 


11.6 


12.6 


12.2 


A 


12.4 


12.6 


12.5 
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Vowel - 2:. 

Cour. 

either 7 
sequence; 
thongs . 
The 

are show 



O 




u ong Frequencies 

r English vowels and diphthongs and errors as 
Is or diphthongs, more than 93% of all vowel 
--2 seen to be vowels; only about 7% are diph- 

lute and relative vowel and diphthong frequencies 
Table XI. 



TABLE XI 

Vowel Diphthong Frequencies 



Vowels 
2 iph thongs 



Number Percentage 

89,014 93.1 

6,606 6.9 

95 ,620 
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Vowel Types 

Vowels may be thought of as being of 12 basic types. 

The absolute and relative frequencies of these types are 
shown in Table XII. 

The relative frequencies are shown in terms of a per- 
centage of all vowels (but not diphthongs) . Several things 
may be seen from this table. More than a third of all vowels 
are unmarked; they are low, short, and oral. Almost half of 
all vowels have no sub- or superscript diacritics: i.e., 

they are low and oral (but may be either short or long) . 

The absolute and relative frequencies of these combinations 
are shown in Table XIII. Almost two-thirds of all vowels are 
short rather than long. The absolute and relative frequencies 
of short and long vowels are shown in Table XIV. 

A bit more than half of all vowels are low in tone. The 
absolute and relative frequencies of low, high, rising, and 
falling tone vowels are shown in Table XV. 

A and I combined make up more than two-thirds of all 
vowels. The absolute and relative frequencies of A, E, I, 

0, and U vowels are shown in Table XVI. 
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TABLE XII 
Vowel Types 






Number 


Percentage 


V 


32,684 (1) 


36.7 


w 


11,005 


12.4 


V 7 


22,648 


25.4 


V7V7 


12,314 


13.8 


W7 


111 


.1 


V7V 


2,856 


3.2 


8V 


1,211 


1.4 


8W 


699 


. 8 


8V7 


1,101 


1.2 


8V7V7 


3,226 


/- 

^> * D 


8W7 


-0- 


-0- 


8V7V 


1,112 


1.2 


other 


47 

89,014 
TABLE XIII 


. 1 


Marked - Unmarked Vowels 




Number 


Percentag' 


low H, short O , 


oral 32,684 


36.7 


high U/ long U, 


nasal 56,283 


63.2 


other 


47 


.1 




89,014 


100.0% 


low n, oral 


43,689 


49.1 


high U, nasal 


45,278 


50.9 


other 


47 

89,014 

32 


.1 
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TABLE XIV 

Short - Long Vowels 





Number 


Percentage 


short 


57,644 


64.8 


long 


31,323 


35.2 


other 


47 


.1 



89,014 



TABLE XV 
Vowe 1 Tone 





Number 


Percentage 


low 


45,599 


51.2 


high 


39,289 


44.1 


rising 


111 


.1 


falling 


3,968 


4.5 


other 


47 

89,014 
TABLE XVI 

Oral - Nasal Vowels 


. 1 




Number 


Percentage 


oral 


81,618 


91.7 


nasal 


7,349 


8.3 


other 


47 


.1 
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TABLE XVII 



Vowel Q uality 

Number Percentage 



A 


31,376 


35.2 


E 


12,173 


13.7 


I 


30,265 


34.0 


0 


14,916 


16.8 


U 


237 


.3 


other 


47 


. 1 




89,014 
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Vowel Frequencies 

The absolute rraquencies of all vowel units are given 
in Table XVIII. 

The relative frequencies , in terms of percentages of 

vowels and diphthong units combined, are given in Table 
XIX. 

And, finally, a comparison of adult and child rank 
ordering is given in Table XX. 
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TABLE XVIII 



Absolute Frequencies ; Vowel Units 





Adult 


Child 


Total 


A 


4,490 


7,505 


11,995 


AA 


1,636 


2,246 


3,882 


A7 


3,346 


4,464 


7,810 


A7A7 


1,267 


2,811 


4,078 


AA7 


4 


4 


8 


A7A 


506 


595 


1,101 


8A 


605 


115 


720 


8AA 


79 


334 


413 


SA7 


93 


247 


340 


8A7A7 


211 


491 


702 


8AA7 


-0- 


-0- 


-0- 


8A7A 


137 


190 


327 


E 


1,433 


2,304 


3,737 


EE 


702 


1,112 


1,814 


E7 


1,362 


1,704 


3,066 


E7E7 


397 


986 


1,383 


EE 7 


-0- 


-0- 


-0- 


E7E 


359 


485 


844 


8E 


-0- 


-0- 


-0- 


8EE 


2 


6 


8 


8E7 


-9- 


2 


2 


8E7E7 


238 


589 


827 


8EE7 


-0- 


-0- 


-0- 


8E7E 


96 


396 


492 


I 


4,176 


7,497 


11,673 


II 


773 


2,212 


2,985 


17 


3,345 


5,638 


8,983 


1717 


1,998 


1,311 


3,309 


117 


35 


17 


52 


1 71 


247 


432 


679 


81 


127 


246 


373 


811 


52 


144 


196 


817 


204 


393 


597 


81 717 


517 


660 


1,177 


8117 


-o- 


-0- 


-0- 


8171 


82 


159 


241 


0 


2,011 


3,031 


5,042 


00 


959 


1,365 


2,324 


07 


1,176 


1,613 


2,789 


0707 


812 


2,732 


3,544 


007 


39 


12 


51 


070 


135 


97 


232 


80 


45 


73 


118 


800 


12 


70 


82 


807 


75 


87 


162 


80707 


114 


406 


520 


8007 


-0- 


-0- 


- 0 - 


8070 


15 


37 


52 


U (E) 


46 


191 


237 
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y\ 

AA 

A7 

A7A7 

AA7 

A7A 

8A 

8AA 

8A7 

8A7A 

8AA7 

8A7A 

E 

EE 

E7 

E7E7 

EE7 

E7E 

8E 

8EE 

8E7 

8E7E 

8EE7 

8E7E 

I 

II 

17 

1717 

117 

171 

81 

811 

817 

8171 

8117 

8171 

0 

00 

07 

0707 

007 

070 

80 

800 

807 

8070 

8007 

8070 

U(E) 



TABLE XIX 
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Relative Frequencies : Vowel Units 

Adult Child Total 

Percentage Perce itage Percentage 



13.2 


13.6 


13.5 


4.8 


4.0 


4.4 


9 . 8 


8 . 


8 . 8 


3.7 


5 . _ 


4.6 


-0- 


-0- 


-0- 


1.5 


1.1 


1.2 


1.8 


.2 


. 8 


.2 


.6 


. 5 


. 3 


. 4 


.4 


.6 


.9 


.8 


-0- 


-0- 


-0- 


. 4 


.3 


.4 


4.2 


4.2 


4.2 


2.0 


2 . 0 


2.0 


4 . 0 


3.1 


3.4 


1.1 


1.8 


1.6 


-0- 


-0- 


-0- 


1.1 


.9 


.9 


-0- 


-0- 


-0- 


-0- 


-0- 


-0- 


-0- 


-0- 


-0- 


. 7 


1.1 


' o 


-0- 


-0- 


-0- 


. 3 


. 7 


.5 


12.3 


13.6 


13.1 


2.3 


4.0 


3.4 


9.9 


10.2 


10.1 


5.9 


2.4 


3.7 


. 1 


-0- 


.1 


.7 


.8 


. 8 


.4 


.4 


.4 


. 2 


.3 


.2 


. 6 


.7 


.7 


1.5 


1.2 


1.3 


-0- 


-0- 


-0- 


.2 


.3 


.3 


5.9 


5.5 


5.7 


2.8 


n . 5 


2.6 


3.5 


2.9 


3.1 


2.4 


5.0 


4.0 


.1 


-0- 


. 1 


.4 


.2 


.3 


.1 


.1 


.1 


-0- 


.1 


. 1 


.2 


.2 


.2 


.3 


. 7 


.6 


-0- 


-0- 


-0- 


-0- 


.1 


. 1 


.1 


.3 


.3 




TABLE XX 
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Bank Ordering ; 
Adult 



Bank Unit % 



!• A 13.2 

2 - I 12.3 

3. 17 9.9 

4 - A 7 9.8 

5 - 0 5,-9 • 

6. 1717 5.7 

7. AA 4.8 

8 - E 4.2 

9 - E7 4.0 

10. A7A7 3.7 

11 • 07 3 . 5 

12 • 00 2.8 

13. 0707 2.4 

14 - II 2.3 

15 • EE 2.0 

16- 8A 1.8 

17- A7A 1.5 

18. 8I7I7 1.5 

19- E7E7 1.1 

20. E7E 1.1 

21. 8E7E7 .7 

22 . 171 . 7 

23. 8A7A7 .6 

24- 817 .6 

25. 8A7A .4 

26. 81 .4 

27. 0707 .4 

28. 8A7 .3 

29. 8E7E .3 

30. 80707 .3 

31. 8AA .2 

32. 811 .2 

33. 8171 .2 

34. 807 .2 

35. H7 .1 

36. 007 .1 

37. 80 .1 

38. u(E) .1 

39. AA7 -0- 

40. 8AA7 -0- 

41. EE7 -0- 

42. 8E -0- 

43. 8EE -0- 

44. 8E7 -0- 

45. 8EE7 -0- 

46. 8117 -0- 

47. 800 -0- 

48. 8007 -0- 

49. 8070 -0- 



Vowel Units 



Child 


Bank 


Unit 


% 


1. 


A 


13.6 


2 . 


I 


13.6 


3. 


17 


10.2 


A 

• 


a 7 


8,1 


5. 


0 


5.5 


6. 


A7A7 


5.1 


7. 


0707 


5.0 


8. 


E 


4.2 


9. 


AA 


4.0 


10. 


II 


4.0 


11. 


E7 


3.1 


12. 


07 


2.9 


13. 


00 


2.5 


14. 


1717 


2.4 


15. 


EE 


2.0 


16. 


E7E7 


1.8 


17. 


81717 


1.2 


18. 


A7A 


1.1 


19. 


8E7E7 


1.1 


20. 


8A7A7 


.9 


21. 


E7E 


.9 


22. 


171 


.8 


23. 


8E7E 


.7 


24 . 


817 


.7 


25. 


80707 


.7 


26. 


8AA 


.6 


27. 


8 A7 


.4 


28. 


81 


. 4 


29. 


8A7A 


. 3 


30. 


811 


. 3 


31. 


8171 


. 3 


32. 


U(E) 


. 3 


33. 


8A 


.2 


34. 


070 


.2 


35. 


807 


.2 


36. 


80 


.1 


37. 


800 


.1 


38. 


8070 


.1 


39. 


AA7 


-0- 


40. 


8AA7 


-0- 


41. 


EE 7 


-0- 


42. 


8E 


-0- 


43. 


8EE 


-0- 


44. 


8E7 


-0- 


45. 


8EE7 


-0- 


46. 


117 


-0- 


47. 


8117 


-0- 


48. 


007 


-0- 


49. 


8007 


-0- 
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III. B. DIPHTHONG UNITS 
Diphthong Types 

As noted earlier, 7% of all vowel sequences are diph- 
thongs. The 6,606 diphthongs include 393 definite English 
diphthongs and at least 10 erroneous forms. 

The absolute and relative frequencies of diphthong types 
are given in Table XXI. Here it should be noted that a number 
of potential types do not occur and several that do occur 
turn out to be errors. Those marked with a (?) are 
sequences the computer failed to exemplify; those marked with 
a (*) are sequences later found to involve spelling errors. 

Thus, it turns out that there are no bona fide nasal diph- 
thongs, "four-vowel" diphthongs, or diphthongs with what one 
might call circumflex tone. Despite Hoijer's examples, it 
would seem that these can probably be discarded as real 
diphthong types. 

The absolute and relative frequencies of two-vowel and 
three-vowel diphthongs is given in Table XXII where it is 
seen that almost three quarters of all diphthongs are two- 
vowel diphthongs. 

The absolute and relative frequencies of low, high, 
rising, and falling tone are shown in Table XXIII where it is 
seen that almost half of all diphthongs are high. It may be noticed 
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in Table XXI that although, as suggested, tone on rising or 
falling diphthongs, tends to follow the division between the 
two vowels involved, there are the 87 V, forms which 

do not do so. 

The absolute and relative frequencies of oral and nasal 
vowels are shown in Table XXIV. It was noted earlier that 
the one nasal diphthong probably involves misspelling. 

There are, for all practical purposes, then, no overtly marked 
diphthongs . 

The absolute and relative frequencies of the different 
vowel combinations are given in Table XXV where it is seen 
that more than two-thirds of all diphthongs are E-I combina- 
tions. As E-I seems to result from the conjunction of 
underlying A and I, the two most common vowels, this is not 
surprising. No Navajo 1-0 sequences were found. They do 
occur; see for example, optative forms such as k'loodlaad 
in Young and Morgan (p. 52). 
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TABLE XXI 
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Diphthong Types 

Number Percentage 



V 1 V 2 


844 


12.8 


V ' 7 V 7 


3,033 


45.9 


V 1 V 2 7 


3 


>•1 


V 1 7V 2 


909 


13.8 


V 1 V 2 V 2 


1,104 


16.7 


V 1 7V 2 7V 2 7 


3 (?) (1) 


>.l 


V. V_7V„7 


1(*) (2) 


>.l 


1 Z z 






V 1 V 2 V 2 7 


87 


1.3 


V V 2 V 2 


126 


1.9 


V 1 V 1 V 2 


-0- 


-0- 


V 1 7V 1 7V 2 7 


-0- 


-0- 


V 1 V 1 V 2 7 


91 


1.4 


V 1 7V 1 7V 2 


-0- 




V 1 7V 1 V 2 


K ) (3) 


>•1 


SV 1 V 2 


-0- 




8V 1 7V 2 7 


1( ) (4) 


>•1 


English 


393 


5.9 


Errors 


10 


.2 



6,606 



"*■ ^ The exemplification program failed to produce examples of the 
A7I7I7 and E7I7I7 sequences the counting program had noted 

( 2 ) 

The 01 71 7 sequence involves an error in word separation. 
&NA7NI7SDZA7AGOI 7I7SHJ8A7A7 should be written as two words 
NA7NI7SDZA7AGO and I7I7SHJ8A7A7 . 

The E7EI sequence involves the misspelling of LE7I ' as *LE7EI' 

(4) . . 

The 8E7I7 sequence involves either an individual variant or a 

misspelling. *NI7i'8E7Eu' is usually pronounced, and spelled 

as NI7T ' 8E7E7 ' or NI7T'E7E7 ? . 
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